Method and system for text retrieval for computer-assisted item creation

ABSTRACT

A tool, method, and system for use in the development of sentence-based test items are disclosed. The tool may include a user interface that may include a database selection field, a sentence pattern entry field, an option pane, and an output pane. The tool may search a database for one or more sentences and may generate one or more responses to the one or more sentences. The one or more sentences and one or more responses may be used to produce the sentence-based test items. The tool may allow test items to be developed more quickly and easily than manual test item authoring. Accordingly, test item development costs may be lowered and test security may be enhanced.

CROSS-REFERENCE TO RELATED APPLICATION

This patent application claims priority to, and incorporates byreference in its entirety, U.S. Provisional Patent Application No.60/642,727, entitled “Method and System for Text Retrieval forComputer-Assisted Item Creation” and filed Jan. 10, 2005.

TECHNICAL FIELD

The present invention relates generally to the field of test itemgeneration. In particular, the present invention relates to retrievingtext suitable for a sentence-based test item by scanning a corpus for aparticular search string including specific words, phrases, and/orgrammatical constructions.

BACKGROUND

Standardized testing is prevalent in the United States today. Suchtesting is used for higher education entrance examinations andachievement testing at the primary and secondary school levels. Theprevalence of standardized testing in the United States has been furtherbolstered by the No Child Left Behind Act of 2001, which emphasizesnationwide test-based assessment of student achievement.

Traditional multiple-choice assessment examinations contain one or moresections, each including a series of test items. An assessmentexamination presents several answer choices (i.e., A, B, C, D, etc.) foreach test item. An examinee is instructed to determine the best answerfrom the set of choices and indicate his or her choice on an answersheet.

Numerous educational assessments include a writing component. Sometests, such as the SAT® test (a college-entrance examination) and theGMAT® test (a business-school entrance examination) assess writingproficiency by eliciting a sample of the examinee's writing directly by,for example, requiring completion of a “constructed-response” task.Other tests, such as the GRE® test (a graduate school examination), theTOEIC® test (Test of English for International Communication), and thePRAXIS™ test (a teacher proficiency examination) assess writing (orverbal skills more generally) through test items aimed at particularaspects of grammar, usage and/or vocabulary. Questions on such tests aregenerally either multiple-choice items or free-response items based on asingle sentence. For example, the examinee might be asked to fill in anappropriate word for a blank inserted in a sentence.

Typically, test items are generated manually once test developersdetermine that a test requires a certain number of items. Items can begenerated to test for comparative construction usage, pronoun usage,redundant word usage, and the like. Upon creating such test items, theitems are often pre-tested to determine each item's difficulty levelempirically.

Accordingly, item development can be a slow process. Currently, testdevelopers must think of a scenario in which to embed relevantgrammatical material. Then, the developers must determine the particularphrasing to use. For example, in the case of items which test anexaminee's use of the comparative, the developers might have to decidewhether an item will test comparative adjectives or nouns (i.e., moreforgetful vs. more cheese) and whether to use the morphological orperiphrastic form of the comparative (i.e., faster vs. more quickly).Finally, test developers must determine an appropriate set of distractorresponses for the item.

What is needed is a method and system for more quickly generating testitems that test aspects of grammar, usage, and/or vocabulary.

A need exists for an application that searches corpora for existingsentences that might be suitable for sentence-based test items.

A further need exists for an application that generatesnatural-sounding, rather than stilted, test items.

The present invention is directed to solving one or more of theabove-listed problems.

SUMMARY

In an embodiment, a tool for use in the development of sentence-basedtest items may include a user interface that may include a databaseselection field, a sentence pattern entry field, an option pane, and anoutput pane. The tool may search a database for one or more sentencesand may generate one or more responses to the one or more sentences. Thesentences and responses may be used to produce the sentence-based testitems.

The database selection field may select the database that is used tosearch for the one or more sentences. The database may include textmaterial. The sentence pattern entry field may be used to construct asentence pattern that may be used to search the database for the one ormore sentences and the option pane may be used to construct the one ormore response patterns that may be used to generate one or moreresponses based on the one or more sentences. The output pane maydisplay the one or more sentences and the one or more responses.

In an embodiment, the user interface may include an icon for beginningthe search of the database. In another embodiment, the user interfacemay further include one or more search parameters including a minimumand a maximum sentence length and a minimum and a maximum number ofsentences returned by the search of the database. In a furtherembodiment, the user interface may include a language search icon and aphonetic search icon.

In an embodiment, a method of developing sentence-based test items mayinclude using a database selection field to select a database, using asentence pattern entry field to construct a sentence pattern used tosearch the database for one or more sentences, using an option pane toconstruct one or more response patterns used to generate one or moreresponses based on the one or more sentences, searching the database forthe one or more sentences, and generating the one or more responses,wherein the one or more sentences and one or more responses are used toform the sentence-based test items, and displaying the sentence-basedtest items in an output pane.

In an embodiment, a system for developing sentence-based test items mayinclude a processor, a processor-readable storage medium incommunication with the processor, and a display. The processor-readablestorage medium may contain one or more programming instructions forperforming the method of developing sentence-based test items.

BRIEF DESCRIPTION OF THE DRAWINGS

Aspects, features, benefits and advantages of the embodiments of thepresent invention will be apparent with regard to the followingdescription, appended claims and accompanying drawings where:

FIG. 1 depicts a first exemplary test item according to an embodiment.

FIG. 2 depicts a second exemplary test item according to an embodiment.

FIG. 3 depicts a graphical user interface for retrieving text from adatabase and generating test items according to an embodiment.

FIG. 4 depicts an exemplary method of developing sentence-based testitems according to an embodiment.

FIG. 5 depicts an exemplary system for the development of sentence-basedtest items according to an embodiment.

DETAILED DESCRIPTION

Before the present methods, systems and materials are described, it isto be understood that this invention is not limited to the particularmethodologies, systems and materials described, as these may vary. It isalso to be understood that the terminology used in the description isfor the purpose of describing the particular versions or embodimentsonly, and is not intended to limit the scope of the invention which willbe limited only by the appended claims.

It must also be noted that as used herein and in the appended claims,the singular forms “a,” “an,” and “the” include plural references unlessthe context clearly dictates otherwise. Thus, for example, reference toa “test item” is a reference to one or more test items and equivalentsthereof known to those skilled in the art, and so forth. Unless definedotherwise, all technical and scientific terms used herein have the samemeanings as commonly understood by one of ordinary skill in the art.Although any methods, materials, and devices similar or equivalent tothose described herein can be used in the practice or testing ofembodiments of the invention, the preferred methods, materials, anddevices are now described. All publications mentioned herein areincorporated by reference. Nothing herein is to be construed as anadmission that the invention is not entitled to antedate such disclosureby virtue of prior invention.

Test items from the same class may cover the same or similar content,but may vary in superficial aspects, such as wording, the order in whichdistractors and the correct answer occur in a multiple-choice item, andthe like. For example, mathematical test items dealing with thePythagorean theorem may include essentially the same content (a²+b²=c²),but with values for the variables that differ from item to item.Accordingly, it may be beneficial to generate these test items from thesame general item model. For the test items described below in referenceto FIGS. 1 and 2, which are designed to assess writing skills, an itemmodel may encapsulate a certain narrow content area, such as the use ofthe word “only” with a subject noun phrase in its focus. The choice ofsentences instantiating this pattern may vary.

Another criterion which may be used as a basis for designing item modelsis whether the variation among instances from the model aresystematically related to the difficulty level of the test item. Forexample, in the exemplary Pythagorean theorem test item model, instancesmay vary in 1) the length of the triangle's sides, 2) whether thelengths are integers or real numbers, and 3) whether the examinee isasked to solve for the hypotenuse or one of the other sides. The itemmodel may be more useful if it allows test developers to predict theeffect that each variation has on the test item's difficulty. For verbaltest items, the variables affecting the difficulty of the test item maybe, for example, the particular distractor phrases and the vocabularylevel of the text.

Each of FIGS. 1 and 2 depicts an exemplary sentence-based test itemaccording to an embodiment. Each depicted test item may include a singlesentence in the stem (the main body) of the test item and four answeroptions. The examinee may be asked to select the option which bestcompletes the sentence (when the option text is substituted for thelacuna in the stem).

FIG. 1 depicts a test item which assesses the examinee's ability toproperly use the pronoun “another” to refer to a nominal expression. Thedistractor answers (incorrect responses) may include other words andphrases which are morphologically related to the proper response(“another”).

FIG. 2 depicts a test item which tests the use of the term “only” tocontrastively focus a nominal. In this case, the distractor answers mayinclude various incorrect permutations of the correct word order.

The construction of test items to assess writing skills may be performedby using full regular expressions to identify candidate sentences. Dueto the test items being based on grammatical constructions and not onsingle vocabulary items, regular expressions may be required. Forexample, sentences suitable for satisfying the requirements of the testitem in FIG. 1 may be fairly simple to locate in a corpus. Since theitem has to do with the correct use of the word “another,” a search of adatabase that returns all sentences using that word may be performed.However, such a search may result in too many sentences being returned.A more focused search including only sentences with “another”functioning as a complete noun phrase and referring to a nominalexpression earlier in the same sentence may result in a targeted groupof sentences that are most similar to the current test item.

Similarly, a sentence similar to the stem in FIG. 2 may include the word“only,” followed by a subject noun phrase, followed by a verb. Finding asentence simply containing the word “only” may not be sufficient since“only” may be joined with phrases of other word categories (verb,adjective, etc.) or noun phrases other than a subject. Determining whichcharacteristics to use for a search may depend upon the type of testitem that the test developer desires to use.

For some test items, the appearance of a specific word within a sentencemay not be necessary. For example, distinguishing characteristics foritems testing proper verbal agreement may include whether a sentence hasa main verb and whether the sentence has multiple noun phrases earlierin the sentence with which the verb could potentially agree. In such asearch, an exact-match search for specific words may be insufficient toreturn satisfactory sentences from the corpus. Instead, a way ofsearching for sentences that match a sentence pattern may be used. Thesentence pattern may correspond to, for example, a grammaticalconstruction. Accordingly, the sentence patterns that test developersdescribe may make reference to one or more of a specific word or a moregeneral class, such as “a noun” or “a noun phrase.”

FIG. 3 depicts a graphical user interface for retrieving text from acorpus and generating test items according to an embodiment. A graphicaluser interface 300 may include a database selection field 310, asentence pattern entry field 320, an option pane 330, and an output pane340.

The database selection field 310 may be used to select a database fromwhich to select sentences for test items. As shown in FIG. 3, thepresently selected database may include articles under 1000 words inlength. The disclosed database merely exemplifies databases that may beselected. It is not intended that the present disclosure be limited tosuch a database, but rather that all possible databases containing textmaterial be included within the scope of this disclosure.

The sentence pattern entry field 320 may be used to construct a sentencepattern that may be used to search the database for one or moresentences. As shown in FIG. 3, the entered sentence pattern (“Not only{[VERB] [NOMINAL]}”) may permit a search for one or more sentencesincluding the words “Not only” followed by a verb that is followed by anominal phrase that relates to the verb. The braces (“{” and “}”) may beused to define the limits of the lacuna for each selected sentence. Thesyntax shown in the sentence pattern entry field 320 is merely exemplaryof syntax used to define sentence patterns for a search.

A test developer may use the option pane 330 to construct one or moreresponse patterns that may be used to generate one or more responsesbased on the one or more sentences. The responses may include a correctresponse and incorrect or distractor responses. As shown in FIG. 3,three distractor responses may be generated for each selected sentence.A first distractor response 332 may include the nominal phrase from theselected sentence (represented in the option pane 330 by “[2]”, i.e.,the second-listed bracketed search element). A second distractorresponse 334 may include the nominal phrase followed by the verb fromthe selected sentence (represented in the option pane 330 by “[2][1]”).A third distractor response 336 may include the word “it” followed bythe verb followed by the nominal phrase (represented in the option pane330 by “it [1][2]”). Alternate and/or additional distractor responsepatterns may be used to define the answer choices for each selectedsentence.

The output pane 340 may display the one or more sentences and the one ormore responses. As shown in the output pane 340, the one or moresentences and the one or more responses may be used to form thesentence-based test items. Each entry in the output pane 340 maycorrespond to a sentence matching the sentence pattern defined in thesentence pattern entry field 320 that is located within the corpus. Asdescribed above, the lacuna may correspond to the braced portion of thesentence pattern. Moreover, the distractor responses may correspond tothe definitions in the option pane 330. The distractor responses andcorrect response may be randomly ordered or an order may be specified.

In an embodiment, a graphical user interface 300 may further includebuttons to begin a search 350 and construct test items 352 (e.g., byinserting distractor responses). The graphical user interface may alsodefine one or more search parameters, such as a minimum sentence length(defined as, for example, the number of words in the sentence) 354, amaximum sentence length 356, and a minimum 361 and a maximum 358 numberof sentences returned by the search.

In an embodiment, the graphical user interface 300 may be used to selecta language 360 to be used for the search. Selecting the language 360 mayfurther restrict the database that is searched. For example, setting thelanguage 360 to English may result in the return of sentences fromEnglish-language texts.

In an alternate embodiment, other aspects of linguistic structure mayalso be incorporated into the graphical user interface and itsunderlying code structure. For example, a phonetic search icon 362 maybe added which may permit a test developer to search for sentenceshaving specific sound patterns. Such an embodiment may be useful inselecting sentences for a read-aloud task. Such a task may be desirable,for example, for a test of English as a foreign language.

In an alternate embodiment, different question types may be generatedfor test items. For example, each of the test items shown in FIGS. 1-3includes a stem with a plurality of responses (including distractors andthe correct response) which fill in a lacuna within the stem. In analternate test item type, no stem may be used, but each response mayinclude a complete sentence in which the distractor response or correctresponse has been inserted. In yet another type of test item, theexaminee may be required to identify the location of a grammatical errorwithin a sentence presented in the stem of the test item. Alternateand/or additional test item types are also encompassed within the scopeof the present disclosure.

FIG. 4 shows a method of developing sentence-based test items. Themethod may include using the database selection field to select adatabase 400, using the sentence pattern entry field to construct asentence pattern used to search the database for one or more sentences401, using the option pane to construct one or more response patternsused to generate one or more responses based on the one or moresentences 402, searching the database for the one or more sentences andgenerating the one or more responses 403, wherein the one or moresentences and one or more responses are used to form the sentence-basedtest items, and displaying at least one sentence-based test item inoutput pane 404.

FIG. 5 shows a block diagram of exemplary hardware that may be used tocontain and/or implement the program instructions of system embodimentsof the present invention. A bus 528 may serve as the information highwayinterconnecting the other illustrated components of the hardware. Aprocessor 520, such as a central processing unit (CPU), may performcalculations and logic operations required to execute a program. Aprocessor-readable storage medium 502, such as read only memory (ROM)518 and random access memory (RAM) 520, may be in communication with theprocessor 520 and may contain one or more programming instructions forperforming the method of developing sentence-based test items.Optionally, program instructions may be stored on a computer readablecarrier such as a digital disk, recordable memory device, or otherrecording medium, a communications signal, or a carrier wave.

A disk controller 504 interfaces one or more optional disk drives to thesystem bus 528. These disk drives may be external or internal floppydisk drives such as 510, external or internal CD-ROM, CD-R, CD-RW or DVDdrives such as 506, or external or internal hard drives 508. Asindicated previously, these various disk drives and disk controllers areoptional devices.

Each of the element managers, real-time data buffer, conveyors, fileinput processor, database index shared access memory loader, referencedata buffer and data managers may include a software application storedin one or more of the disk drives connected to the disk controller 504,the ROM 518 and/or the RAM 520. Preferably, the processor 502 may accesseach component as required.

A display interface 522 may permit information from the bus 528 to bedisplayed on a display 524 in audio, graphic, or alphanumeric format.Communication with external devices may optionally occur using variouscommunication ports 526.

In addition to the standard computer-type components, the hardware mayalso include data input devices, such as a keyboard 514, or other inputdevice 516, such as a microphone, remote control, pointer, mouse and/orjoystick.

It is to be understood that the invention is not limited in itsapplication to the details of construction and to the arrangements ofthe components or steps set forth in this description or illustrated inthe drawings. The disclosed method and system are capable of otherembodiments and of being practiced and carried out in various ways.Hence, it is to be understood that the phraseology and terminologyemployed herein are for the purpose of description and should not beregarded as limiting.

As such, those skilled in the art will appreciate that the conceptionupon which this disclosure is based may readily be utilized as a basisfor the designing of other structures, methods and systems for carryingout the several purposes of the present invention. It is important,therefore, that the claims be regarded as including such equivalentconstructions insofar as they do not depart from the spirit and scope ofthe disclosed embodiments.

1. A tool for use in the development of sentence-based test items, thetool comprising: a user interface comprising a database selection field,a sentence pattern entry field, an option pane, and an output pane,wherein the tool searches a database for one or more sentences andgenerates one or more responses to the one or more sentences, whereinthe one or more sentences and the one or more responses are used toproduce the sentence-based test items.
 2. The tool of claim 1, whereinthe database selection field is used to select the database, wherein thedatabase comprises text material.
 3. The tool of claim 1, wherein thesentence pattern entry field is used to construct a sentence pattern,wherein the sentence pattern is used to search the database for the oneor more sentences.
 4. The tool of claim 1, wherein the option pane isused to construct one or more response patterns, where the responsepatterns are used to generate one or more responses based on the one ormore sentences.
 5. The tool of claim 1, wherein the output pane is usedto display the one or more sentences and the one or more responses. 6.The tool of claim 1, wherein the user interface further comprises anicon for initiating a search of the database.
 7. The tool of claim 1,wherein the user interface further comprises one or more searchparameters comprising a minimum and maximum sentence length and aminimum and maximum number of sentences returned by a search of thedatabase.
 8. The tool of claim 1, wherein the user interface furthercomprises a language search icon and a phonetic search icon.
 9. A methodof developing sentence-based test items, the method comprising:providing a tool comprising a user interface including a databaseselection field, a sentence pattern entry field, an option pane, and anoutput pane; using the database selection field to select a database,wherein the database comprises text material; using the sentence patternentry field to construct a sentence pattern, wherein the sentencepattern is used to search the database for one or more sentences; usingthe option pane to construct one or more response patterns, wherein theresponse patterns are used to generate one or more responses based onthe one or more sentences; searching the database for the one or moresentences and generating the one or more responses, wherein the one ormore sentences and the one or more responses are used to form one ormore sentence-based test items; and displaying at least onesentence-based test item in the output pane.
 10. The method of claim 9,wherein the user interface further comprises an icon for initiating asearch of the database.
 11. The method of claim 9, wherein the userinterface further comprises one or more search parameters comprising aminimum and maximum sentence length and a minimum and maximum number ofsentences returned by a search of the database.
 12. The method of claim9, wherein the user interface further comprises a language search iconand a phonetic search icon.
 13. A system for developing sentence-basedtest items, the system comprising: a processor; a processor-readablestorage medium in communication with the processor; and a display;wherein the processor-readable storage medium includes one or moreprogramming instructions for performing a method of developingsentence-based test items, the method comprising: providing a toolcomprising a user interface including a database selection field, asentence pattern entry field, an option pane, and an output pane,wherein the user interface is displayed on the display, using thedatabase selection field to select a database, using the sentencepattern entry field to construct a sentence pattern, wherein thesentence pattern is used to search the database for one or moresentences, using the option pane to construct one or more responsepatterns, wherein the response patterns are used to generate one or moreresponses based on the one or more sentences, searching the database forthe one or more sentences and generating the one or more responses,wherein the one or more sentences and the one or more responses are usedto form one or more sentence-based test items, and displaying at leastone sentence-based test item in the output pane.